Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 490
Filtrar
1.
Nat Commun ; 15(1): 2356, 2024 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-38490991

RESUMO

Machine learning applied to large compendia of transcriptomic data has enabled the decomposition of bacterial transcriptomes to identify independently modulated sets of genes, such iModulons represent specific cellular functions. The identification of iModulons enables accurate identification of genes necessary and sufficient for cross-species transfer of cellular functions. We demonstrate cross-species transfer of: 1) the biotransformation of vanillate to protocatechuate, 2) a malonate catabolic pathway, 3) a catabolic pathway for 2,3-butanediol, and 4) an antimicrobial resistance to ampicillin found in multiple Pseudomonas species to Escherichia coli. iModulon-based engineering is a transformative strategy as it includes all genes comprising the transferred cellular function, including genes without functional annotation. Adaptive laboratory evolution was deployed to optimize the cellular function transferred, revealing mutations in the host. Combining big data analytics and laboratory evolution thus enhances the level of understanding of systems biology, and synthetic biology for strain design and development.


Assuntos
Escherichia coli , Biologia Sintética , Escherichia coli/genética , Escherichia coli/metabolismo , Genes Bacterianos , Pseudomonas/genética
2.
Artigo em Inglês | MEDLINE | ID: mdl-38439699

RESUMO

The demand for discovering novel microbial secondary metabolites is growing to address the limitations in bioactivities such as antibacterial, antifungal, anticancer, anthelmintic, and immunosuppressive functions. Among microbes, the genus Streptomyces holds particular significance for secondary metabolite discovery. Each Streptomyces species typically encodes approximately 30 secondary metabolite biosynthetic gene clusters (smBGCs) within its genome, which are mostly uncharacterized in terms of their products and bioactivities. The development of next-generation sequencing has enabled the identification of a large number of potent smBGCs for novel secondary metabolites that are imbalanced in number compared with discovered secondary metabolites. The clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated (Cas) system has revolutionized the translation of enormous genomic potential into the discovery of secondary metabolites as the most efficient genetic engineering tool for Streptomyces. In this review, the current status of CRISPR/Cas applications in Streptomyces is summarized, with particular focus on the identification of secondary metabolite biosynthesis gene clusters and their potential applications.This review summarizes the broad range of CRISPR/Cas applications in Streptomyces for natural product discovery and production. ONE-SENTENCE SUMMARY: This review summarizes the broad range of CRISPR/Cas applications in Streptomyces for natural product discovery and production.


Assuntos
Produtos Biológicos , Streptomyces , Streptomyces/genética , Streptomyces/metabolismo , Sistemas CRISPR-Cas , Engenharia Genética , Genoma Bacteriano , Produtos Biológicos/metabolismo , Edição de Genes
3.
mSystems ; 9(3): e0094223, 2024 Mar 19.
Artigo em Inglês | MEDLINE | ID: mdl-38323821

RESUMO

There is growing interest in engineering Pseudomonas putida KT2440 as a microbial chassis for the conversion of renewable and waste-based feedstocks, and metabolic engineering of P. putida relies on the understanding of the functional relationships between genes. In this work, independent component analysis (ICA) was applied to a compendium of existing fitness data from randomly barcoded transposon insertion sequencing (RB-TnSeq) of P. putida KT2440 grown in 179 unique experimental conditions. ICA identified 84 independent groups of genes, which we call fModules ("functional modules"), where gene members displayed shared functional influence in a specific cellular process. This machine learning-based approach both successfully recapitulated previously characterized functional relationships and established hitherto unknown associations between genes. Selected gene members from fModules for hydroxycinnamate metabolism and stress resistance, acetyl coenzyme A assimilation, and nitrogen metabolism were validated with engineered mutants of P. putida. Additionally, functional gene clusters from ICA of RB-TnSeq data sets were compared with regulatory gene clusters from prior ICA of RNAseq data sets to draw connections between gene regulation and function. Because ICA profiles the functional role of several distinct gene networks simultaneously, it can reduce the time required to annotate gene function relative to manual curation of RB-TnSeq data sets. IMPORTANCE: This study demonstrates a rapid, automated approach for elucidating functional modules within complex genetic networks. While Pseudomonas putida randomly barcoded transposon insertion sequencing data were used as a proof of concept, this approach is applicable to any organism with existing functional genomics data sets and may serve as a useful tool for many valuable applications, such as guiding metabolic engineering efforts in other microbes or understanding functional relationships between virulence-associated genes in pathogenic microbes. Furthermore, this work demonstrates that comparison of data obtained from independent component analysis of transcriptomics and gene fitness datasets can elucidate regulatory-functional relationships between genes, which may have utility in a variety of applications, such as metabolic modeling, strain engineering, or identification of antimicrobial drug targets.


Assuntos
Pseudomonas putida , Pseudomonas putida/genética , Redes Reguladoras de Genes , Genômica
4.
mSystems ; 9(3): e0125723, 2024 Mar 19.
Artigo em Inglês | MEDLINE | ID: mdl-38349131

RESUMO

Limosilactobacillus reuteri, a probiotic microbe instrumental to human health and sustainable food production, adapts to diverse environmental shifts via dynamic gene expression. We applied the independent component analysis (ICA) to 117 RNA-seq data sets to decode its transcriptional regulatory network (TRN), identifying 35 distinct signals that modulate specific gene sets. Our findings indicate that the ICA provides a qualitative advancement and captures nuanced relationships within gene clusters that other methods may miss. This study uncovers the fundamental properties of L. reuteri's TRN and deepens our understanding of its arginine metabolism and the co-regulation of riboflavin metabolism and fatty acid conversion. It also sheds light on conditions that regulate genes within a specific biosynthetic gene cluster and allows for the speculation of the potential role of isoprenoid biosynthesis in L. reuteri's adaptive response to environmental changes. By integrating transcriptomics and machine learning, we provide a system-level understanding of L. reuteri's response mechanism to environmental fluctuations, thus setting the stage for modeling the probiotic transcriptome for applications in microbial food production. IMPORTANCE: We have studied Limosilactobacillus reuteri, a beneficial probiotic microbe that plays a significant role in our health and production of sustainable foods, a type of foods that are nutritionally dense and healthier and have low-carbon emissions compared to traditional foods. Similar to how humans adapt their lifestyles to different environments, this microbe adjusts its behavior by modulating the expression of genes. We applied machine learning to analyze large-scale data sets on how these genes behave across diverse conditions. From this, we identified 35 unique patterns demonstrating how L. reuteri adjusts its genes based on 50 unique environmental conditions (such as various sugars, salts, microbial cocultures, human milk, and fruit juice). This research helps us understand better how L. reuteri functions, especially in processes like breaking down certain nutrients and adapting to stressful changes. More importantly, with our findings, we become closer to using this knowledge to improve how we produce more sustainable and healthier foods with the help of microbes.


Assuntos
Limosilactobacillus reuteri , Probióticos , Humanos , Limosilactobacillus reuteri/genética , Perfilação da Expressão Gênica , Transcriptoma/genética , Aprendizado de Máquina
5.
PLoS Comput Biol ; 20(2): e1011865, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-38346086

RESUMO

Generalist microbes have adapted to a multitude of environmental stresses through their integrated stress response system. Individual stress responses have been quantified by E. coli metabolism and expression (ME) models under thermal, oxidative and acid stress, respectively. However, the systematic quantification of cross-stress & cross-talk among these stress responses remains lacking. Here, we present StressME: the unified stress response model of E. coli combining thermal (FoldME), oxidative (OxidizeME) and acid (AcidifyME) stress responses. StressME is the most up to date ME model for E. coli and it reproduces all published single-stress ME models. Additionally, it includes refined rate constants to improve prediction accuracy for wild-type and stress-evolved strains. StressME revealed certain optimal proteome allocation strategies associated with cross-stress and cross-talk responses. These stress-optimal proteomes were shaped by trade-offs between protective vs. metabolic enzymes; cytoplasmic vs. periplasmic chaperones; and expression of stress-specific proteins. As StressME is tuned to compute metabolic and gene expression responses under mild acid, oxidative, and thermal stresses, it is useful for engineering and health applications. The modular design of our open-source package also facilitates model expansion (e.g., to new stress mechanisms) by the computational biology community.


Assuntos
Proteínas de Escherichia coli , Escherichia coli , Escherichia coli/metabolismo , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Estresse Fisiológico/genética , Oxirredução , Proteínas de Choque Térmico/metabolismo , Ácidos/metabolismo , Expressão Gênica
6.
PLoS Comput Biol ; 20(1): e1011824, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38252668

RESUMO

The transcriptional regulatory network (TRN) of E. coli consists of thousands of interactions between regulators and DNA sequences. Regulons are typically determined either from resource-intensive experimental measurement of functional binding sites, or inferred from analysis of high-throughput gene expression datasets. Recently, independent component analysis (ICA) of RNA-seq compendia has shown to be a powerful method for inferring bacterial regulons. However, it remains unclear to what extent regulons predicted by ICA structure have a biochemical basis in promoter sequences. Here, we address this question by developing machine learning models that predict inferred regulon structures in E. coli based on promoter sequence features. Models were constructed successfully (cross-validation AUROC > = 0.8) for 85% (40/47) of ICA-inferred E. coli regulons. We found that: 1) The presence of a high scoring regulator motif in the promoter region was sufficient to specify regulatory activity in 40% (19/47) of the regulons, 2) Additional features, such as DNA shape and extended motifs that can account for regulator multimeric binding, helped to specify regulon structure for the remaining 60% of regulons (28/47); 3) investigating regulons where initial machine learning models failed revealed new regulator-specific sequence features that improved model accuracy. Finally, we found that strong regulatory binding sequences underlie both the genes shared between ICA-inferred and experimental regulons as well as genes in the E. coli core pan-regulon of Fur. This work demonstrates that the structure of ICA-inferred regulons largely can be understood through the strength of regulator binding sites in promoter regions, reinforcing the utility of top-down inference for regulon discovery.


Assuntos
Escherichia coli , Regulon , Regulon/genética , Escherichia coli/genética , Escherichia coli/metabolismo , Bactérias/genética , Sítios de Ligação/genética , Regiões Promotoras Genéticas/genética , Regulação Bacteriana da Expressão Gênica/genética , Proteínas de Bactérias/metabolismo
7.
mSystems ; 9(2): e0100123, 2024 Feb 20.
Artigo em Inglês | MEDLINE | ID: mdl-38259168

RESUMO

Understanding the dynamics of biological systems in evolving environments is a challenge due to their scale and complexity. Here, we present a computational framework for the timescale decomposition of biochemical reaction networks to distill essential patterns from their intricate dynamics. This approach identifies timescale hierarchies, concentration pools, and coherent structures from time-series data, providing a system-level description of reaction networks at physiologically important timescales. We apply this technique to kinetic models of hypothetical and biological pathways, validating it by reproducing analytically characterized or previously known concentration pools of these pathways. Moreover, by analyzing the timescale hierarchy of the glycolytic pathway, we elucidate the connections between the stoichiometric and dissipative structures of reaction networks and the temporal organization of coherent structures. Specifically, we show that glycolysis is a cofactor-driven pathway, the slowest dynamics of which are described by a balance between high-energy phosphate bond and redox trafficking. Overall, this approach provides more biologically interpretable characterizations of network dynamics than large-scale kinetic models, thus facilitating model reduction and personalized medicine applications. IMPORTANCE Complex interactions within interconnected biochemical reaction networks enable cellular responses to a wide range of unpredictable environmental perturbations. Understanding how biological functions arise from these intricate interactions has been a long-standing problem in biology. Here, we introduce a computational approach to dissect complex biological systems' dynamics in evolving environments. This approach characterizes the timescale hierarchies of complex reaction networks, offering a system-level understanding at physiologically relevant timescales. Analyzing various hypothetical and biological pathways, we show how stoichiometric properties shape the way energy is dissipated throughout reaction networks. Notably, we establish that glycolysis operates as a cofactor-driven pathway, where the slowest dynamics are governed by a balance between high-energy phosphate bonds and redox trafficking. This approach enhances our understanding of network dynamics and facilitates the development of reduced-order kinetic models with biologically interpretable components.


Assuntos
Fenômenos Fisiológicos Celulares , Glicólise , Cinética , Fosfatos
8.
bioRxiv ; 2024 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-38260479

RESUMO

Mature red blood cells (RBCs) lack mitochondria, and thus exclusively rely on glycolysis to generate adenosine triphosphate (ATP) during aging in vivo and during storage in vitro in the blood bank. Here we identify an association between blood donor age, sex, ethnicity and end of storage levels of glycolytic metabolites in 13,029 volunteers from the Recipient Epidemiology and Donor Evaluation Study. Associations were also observed to ancestry-specific genetic polymorphisms in regions coding for phosphofructokinase 1, platelet - which we detected in mature RBCs - hexokinase 1, and for the ADP-ribosyl cyclase 1 and 2 (CD38/BST1). Gene-metabolite associations were validated in fresh and stored RBCs from 525 Diversity Outbred mice, and via multi-omics characterization of 1,929 samples from 643 RBC units during storage. ATP levels, breakdown and deamination into hypoxanthine were associated with hemolysis in vitro and in vivo, both in healthy autologous transfusion recipients and in 4,700 heterologous critically ill patients. Highlights: Blood donor age and sex affect glycolysis in stored RBCs from 13,029 volunteers;Ancestry-based genetic polymorphisms in PFKP and CD38/BST1 influence glycolysis in stored human and fresh or stored murine RBCs;PFKP is detected in pure mature RBCs and boosts glycolytic fluxes when ATP is low, such as in stored RBCs;ATP and hypoxanthine associate with RBC hemolysis in vitro and in vivo in mice and thousands of transfusion recipients.

9.
mSystems ; 9(2): e0060623, 2024 Feb 20.
Artigo em Inglês | MEDLINE | ID: mdl-38189271

RESUMO

Acinetobacter baumannii causes severe infections in humans, resists multiple antibiotics, and survives in stressful environmental conditions due to modulations of its complex transcriptional regulatory network (TRN). Unfortunately, our global understanding of the TRN in this emerging opportunistic pathogen is limited. Here, we apply independent component analysis, an unsupervised machine learning method, to a compendium of 139 RNA-seq data sets of three multidrug-resistant A. baumannii international clonal complex I strains (AB5075, AYE, and AB0057). This analysis allows us to define 49 independently modulated gene sets, which we call iModulons. Analysis of the identified A. baumannii iModulons reveals validating parallels to previously defined biological operons/regulons and provides a framework for defining unknown regulons. By utilizing the iModulons, we uncover potential mechanisms for a RpoS-independent general stress response, define global stress-virulence trade-offs, and identify conditions that may induce plasmid-borne multidrug resistance. The iModulons provide a model of the TRN that emphasizes the importance of transcriptional regulation of virulence phenotypes in A. baumannii. Furthermore, they suggest the possibility of future interventions to guide gene expression toward diminished pathogenic potential.IMPORTANCEThe rise in hospital outbreaks of multidrug-resistant Acinetobacter baumannii infections underscores the urgent need for alternatives to traditional broad-spectrum antibiotic therapies. The success of A. baumannii as a significant nosocomial pathogen is largely attributed to its ability to resist antibiotics and survive environmental stressors. However, there is limited literature available on the global, complex regulatory circuitry that shapes these phenotypes. Computational tools that can assist in the elucidation of A. baumannii's transcriptional regulatory network architecture can provide much-needed context for a comprehensive understanding of pathogenesis and virulence, as well as for the development of targeted therapies that modulate these pathways.


Assuntos
Infecções por Acinetobacter , Acinetobacter baumannii , Humanos , Acinetobacter baumannii/genética , Infecções por Acinetobacter/tratamento farmacológico , Virulência/genética , Regulação da Expressão Gênica , Antibacterianos/farmacologia
10.
Nat Commun ; 14(1): 7370, 2023 11 14.
Artigo em Inglês | MEDLINE | ID: mdl-37963869

RESUMO

Functional annotation of open reading frames in microbial genomes remains substantially incomplete. Enzymes constitute the most prevalent functional gene class in microbial genomes and can be described by their specific catalytic functions using the Enzyme Commission (EC) number. Consequently, the ability to predict EC numbers could substantially reduce the number of un-annotated genes. Here we present a deep learning model, DeepECtransformer, which utilizes transformer layers as a neural network architecture to predict EC numbers. Using the extensively studied Escherichia coli K-12 MG1655 genome, DeepECtransformer predicted EC numbers for 464 un-annotated genes. We experimentally validated the enzymatic activities predicted for three proteins (YgfF, YciO, and YjdM). Further examination of the neural network's reasoning process revealed that the trained neural network relies on functional motifs of enzymes to predict EC numbers. Thus, DeepECtransformer is a method that facilitates the functional annotation of uncharacterized genes.


Assuntos
Aprendizado Profundo , Escherichia coli K12 , Escherichia coli K12/genética , Proteínas/genética , Genoma , Escherichia coli/genética , Anotação de Sequência Molecular , Fases de Leitura Aberta
11.
Nat Commun ; 14(1): 7690, 2023 Nov 24.
Artigo em Inglês | MEDLINE | ID: mdl-38001096

RESUMO

Surveillance programs for managing antimicrobial resistance (AMR) have yielded thousands of genomes suited for data-driven mechanism discovery. We present a workflow integrating pangenomics, gene annotation, and machine learning to identify AMR genes at scale. When applied to 12 species, 27,155 genomes, and 69 drugs, we 1) find AMR gene transfer mostly confined within related species, with 925 genes in multiple species but just eight in multiple phylogenetic classes, 2) demonstrate that discovery-oriented support vector machines outperform contemporary methods at recovering known AMR genes, recovering 263 genes compared to 145 by Pyseer, and 3) identify 142 AMR gene candidates. Validation of two candidates in E. coli BW25113 reveals cases of conditional resistance: ΔcycA confers ciprofloxacin resistance in minimal media with D-serine, and frdD V111D confers ampicillin resistance in the presence of ampC by modifying the overlapping promoter. We expect this approach to be adaptable to other species and phenotypes.


Assuntos
Antibacterianos , Escherichia coli , Antibacterianos/farmacologia , Escherichia coli/genética , Farmacorresistência Bacteriana/genética , Filogenia , Ciprofloxacina/farmacologia
12.
Metabolites ; 13(11)2023 Nov 03.
Artigo em Inglês | MEDLINE | ID: mdl-37999223

RESUMO

Pathway analysis is ubiquitous in biological data analysis due to the ability to integrate small simultaneous changes in functionally related components. While pathways are often defined based on either manual curation or network topological properties, an attractive alternative is to generate pathways around specific functions, in which metabolism can be defined as the production and consumption of specific metabolites. In this work, we present an algorithm, termed MetPath, that calculates pathways for condition-specific production and consumption of specific metabolites. We demonstrate that these pathways have several useful properties. Pathways calculated in this manner (1) take into account the condition-specific metabolic role of a gene product, (2) are localized around defined metabolic functions, and (3) quantitatively weigh the importance of expression to a function based on the flux contribution of the gene product. We demonstrate how these pathways elucidate network interactions between genes across different growth conditions and between cell types. Furthermore, the calculated pathways compare favorably to manually curated pathways in predicting the expression correlation between genes. To facilitate the use of these pathways, we have generated a large compendium of pathways under different growth conditions for E. coli. The MetPath algorithm provides a useful tool for metabolic network-based statistical analyses of high-throughput data.

13.
Metabolites ; 13(11)2023 Nov 11.
Artigo em Inglês | MEDLINE | ID: mdl-37999241

RESUMO

Red blood cells (RBCs) are abundant (more than 80% of the total cells in the human body), yet relatively simple, as they lack nuclei and organelles, including mitochondria. Since the earliest days of biochemistry, the accessibility of blood and RBCs made them an ideal matrix for the characterization of metabolism. Because of this, investigations into RBC metabolism are of extreme relevance for research and diagnostic purposes in scientific and clinical endeavors. The relative simplicity of RBCs has made them an eligible model for the development of reconstruction maps of eukaryotic cell metabolism since the early days of systems biology. Computational models hold the potential to deepen knowledge of RBC metabolism, but also and foremost to predict in silico RBC metabolic behaviors in response to environmental stimuli. Here, we review now classic concepts on RBC metabolism, prior work in systems biology of unicellular organisms, and how this work paved the way for the development of reconstruction models of RBC metabolism. Translationally, we discuss how the fields of metabolomics and systems biology have generated evidence to advance our understanding of the RBC storage lesion, a process of decline in storage quality that impacts over a hundred million blood units transfused every year.

14.
bioRxiv ; 2023 Aug 22.
Artigo em Inglês | MEDLINE | ID: mdl-37662221

RESUMO

Understanding the dynamics of biological systems in evolving environments is a challenge due to their scale and complexity. Here, we present a computational framework for timescale decomposition of biochemical reaction networks to distill essential patterns from their intricate dynamics. This approach identifies timescale hierarchies, concentration pools, and coherent structures from time-series data, providing a system-level description of reaction networks at physiologically important timescales. We apply this technique to kinetic models of hypothetical and biological pathways, validating it by reproducing analytically characterized or previously known concentration pools of these pathways. Moreover, by analyzing the timescale hierarchy of the glycolytic pathway, we elucidate the connections between the stoichiometric and dissipative structures of reaction networks and the temporal organization of coherent structures. Specifically, we show that glycolysis is a cofactor driven pathway, the slowest dynamics of which are described by a balance between high-energy phosphate bond and redox trafficking. Overall, this approach provides more biologically interpretable characterizations of network dynamics than large-scale kinetic models, thus facilitating model reduction and personalized medicine applications.

15.
Cell Rep ; 42(9): 113105, 2023 09 26.
Artigo em Inglês | MEDLINE | ID: mdl-37713311

RESUMO

Relationships between the genome, transcriptome, and metabolome underlie all evolved phenotypes. However, it has proved difficult to elucidate these relationships because of the high number of variables measured. A recently developed data analytic method for characterizing the transcriptome can simplify interpretation by grouping genes into independently modulated sets (iModulons). Here, we demonstrate how iModulons reveal deep understanding of the effects of causal mutations and metabolic rewiring. We use adaptive laboratory evolution to generate E. coli strains that tolerate high levels of the redox cycling compound paraquat, which produces reactive oxygen species (ROS). We combine resequencing, iModulons, and metabolic models to elucidate six interacting stress-tolerance mechanisms: (1) modification of transport, (2) activation of ROS stress responses, (3) use of ROS-sensitive iron regulation, (4) motility, (5) broad transcriptional reallocation toward growth, and (6) metabolic rewiring to decrease NADH production. This work thus demonstrates the power of iModulon knowledge mapping for evolution analysis.


Assuntos
Escherichia coli , Paraquat , Paraquat/farmacologia , Espécies Reativas de Oxigênio/metabolismo , Escherichia coli/metabolismo , Transcriptoma/genética , Perfilação da Expressão Gênica
16.
Nucleic Acids Res ; 51(19): 10176-10193, 2023 10 27.
Artigo em Inglês | MEDLINE | ID: mdl-37713610

RESUMO

Transcriptomic data is accumulating rapidly; thus, scalable methods for extracting knowledge from this data are critical. Here, we assembled a top-down expression and regulation knowledge base for Escherichia coli. The expression component is a 1035-sample, high-quality RNA-seq compendium consisting of data generated in our lab using a single experimental protocol. The compendium contains diverse growth conditions, including: 9 media; 39 supplements, including antibiotics; 42 heterologous proteins; and 76 gene knockouts. Using this resource, we elucidated global expression patterns. We used machine learning to extract 201 modules that account for 86% of known regulatory interactions, creating the regulatory component. With these modules, we identified two novel regulons and quantified systems-level regulatory responses. We also integrated 1675 curated, publicly-available transcriptomes into the resource. We demonstrated workflows for analyzing new data against this knowledge base via deconstruction of regulation during aerobic transition. This resource illuminates the E. coli transcriptome at scale and provides a blueprint for top-down transcriptomic analysis of non-model organisms.


Assuntos
Escherichia coli , Bases de Conhecimento , Escherichia coli/genética , Escherichia coli/metabolismo , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Perfilação da Expressão Gênica , Regulação Bacteriana da Expressão Gênica , Transcriptoma
17.
iScience ; 26(9): 107500, 2023 Sep 15.
Artigo em Inglês | MEDLINE | ID: mdl-37636038

RESUMO

The bacterial strain JCVI-syn3.0 stands as the first example of a living organism with a minimized synthetic genome, derived from the Mycoplasma mycoides genome and chemically synthesized in vitro. Here, we report the experimental evolution of a syn3.0- derived strain. Ten independent replicates were evolved for several hundred generations, leading to growth rate improvements of > 15%. Endpoint strains possessed an average of 8 mutations composed of indels and SNPs, with a pronounced C/G- > A/T transversion bias. Multiple genes were repeated mutational targets across the independent lineages, including phase variable lipoprotein activation, 5 distinct; nonsynonymous substitutions in the same membrane transporter protein, and inactivation of an uncharacterized gene. Transcriptomic analysis revealed an overall tradeoff reflected in upregulated ribosomal proteins and downregulated DNA and RNA related proteins during adaptation. This work establishes the suitability of synthetic, minimal strains for laboratory evolution, providing a means to optimize strain growth characteristics and elucidate gene functionality.

18.
mSystems ; 8(5): e0043723, 2023 Oct 26.
Artigo em Inglês | MEDLINE | ID: mdl-37638727

RESUMO

IMPORTANCE: Pseudomonas syringae pv. tomato DC3000 is a model plant pathogen that infects tomatoes and Arabidopsis thaliana. The current understanding of global transcriptional regulation in the pathogen is limited. Here, we applied iModulon analysis to a compendium of RNA-seq data to unravel its transcriptional regulatory network. We characterize each co-regulated gene set, revealing the activity of major regulators across diverse conditions. We provide new insights on the transcriptional dynamics in interactions with the plant immune system and with other bacterial species, such as AlgU-dependent regulation of flagellar genes during plant infection and downregulation of siderophore production in the presence of a siderophore cheater. This study demonstrates the novel application of iModulons in studying temporal dynamics during host-pathogen and microbe-microbe interactions, and reveals specific insights of interest.


Assuntos
Arabidopsis , Microbiota , Pseudomonas syringae/genética , Proteínas de Bactérias/genética , Transcriptoma/genética , Arabidopsis/genética , Aprendizado de Máquina , Sideróforos
19.
Genome Biol ; 24(1): 183, 2023 08 08.
Artigo em Inglês | MEDLINE | ID: mdl-37553643

RESUMO

BACKGROUND: Cumulative sequencing efforts have yielded enough genomes to construct pangenomes for dozens of bacterial species and elucidate intraspecies gene conservation. Given the diversity of organisms for which this is achievable, similar analyses for ancestral species are feasible through the integration of pangenomics and phylogenetics, promising deeper insights into the nature of ancient life. RESULTS: We construct pangenomes for 183 bacterial species from 54,085 genomes and identify their core genomes using a novel statistical model to estimate genome-specific error rates and underlying gene frequencies. The core genomes are then integrated into a phylogenetic tree to reconstruct the core genome of the last bacterial common ancestor (LBCA), yielding three main results: First, the gene content of modern and ancestral core genomes are diverse at the level of individual genes but are similarly distributed by functional category and share several poorly characterized genes. Second, the LBCA core genome is distinct from any individual modern core genome but has many fundamental biological systems intact, especially those involving translation machinery and biosynthetic pathways to all major nucleotides and amino acids. Third, despite this metabolic versatility, the LBCA core genome likely requires additional non-core genes for viability, based on comparisons with the minimal organism, JCVI-Syn3A. CONCLUSIONS: These results suggest that many cellular systems commonly conserved in modern bacteria were not just present in ancient bacteria but were nearly immutable with respect to short-term intraspecies variation. Extending this analysis to other domains of life will likely provide similar insights into more distant ancestral species.


Assuntos
Evolução Molecular , Genoma , Filogenia , Frequência do Gene , Bactérias/genética , Genoma Bacteriano
20.
Food Microbiol ; 115: 104334, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37567624

RESUMO

Lactobacillaceae represent a large family of important microbes that are foundational to the food industry. Many genome sequences of Lactobacillaceae strains are now available, enabling us to conduct a comprehensive pangenome analysis of this family. We collected 3591 high-quality genomes from public sources and found that: 1) they contained enough genomes for 26 species to perform a pangenomic analysis, 2) the normalized Heap's coefficient λ (a measure of pangenome openness) was found to have an average value of 0.27 (ranging from 0.07 to 0.37), 3) the pangenome openness was correlated with the abundance and genomic location of transposons and mobilomes, 4) the pangenome for each species was divided into core, accessory, and rare genomes, that highlight the species-specific properties (such as motility and restriction-modification systems), 5) the pangenome of Lactiplantibacillus plantarum (which contained the highest number of genomes found amongst the 26 species studied) contained nine distinct phylogroups, and 6) genome mining revealed a richness of detected biosynthetic gene clusters, with functions ranging from antimicrobial and probiotic to food preservation, but ∼93% were of unknown function. This study provides the first in-depth comparative pangenomics analysis of the Lactobacillaceae family.


Assuntos
Genômica , Lactobacillaceae , Filogenia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...